CUDA C++ Debugging: Safer GPU Kernel Programming (Generative AI Programming in C++) by Spuler David

CUDA C++ Debugging: Safer GPU Kernel Programming (Generative AI Programming in C++) by Spuler David

Author:Spuler, David
Language: eng
Format: epub
Publisher: Aussie AI Labs
Published: 2024-10-15T00:00:00+00:00


Let’s examine these issues in turn.

Threads-per-Block Multiple of 32

The number of threads per block (aka the “block size”) should be a multiple of the warp size, which is 32 threads. Hence, it can be as low as 32, but commonly recommended block sizes in real-world kernels are often 256 or 512. The maximum permitted by CUDA is 1024 threads per block.

This is not good:

float v[54];

...

int blocks = 2;

mykernel<<<blocks, 27>>>(v,n); // BAD

This might actually work, but it’s very inefficient, and offends the sensibilities of any experienced CUDA programmer, for reasons discussed below.

But first, note that this is worse:

float v[54];

...

int blocks = 54;

mykernel<<<blocks, 1>>>(v,n); // BAD

If the threads per block is not 32, or a multiple of 32, there will be odd threads in a warp that aren’t properly utilized (or might be doing the wrong thing). The reason is that CUDA allows threads in “warps” that contain exactly 32 threads. With the threads per block of 27, there were 5 extra threads, and for 1 thread per block, there were 31 wasted threads. So, instead, you want something more like this:

float v[64];

int n = 64;

...

int blocks = 2;

mykernel<<<blocks,32>>>(v,n); // BETTER

Or you can do this:

float v[64];

int n = 64;

...

int blocks = 1;

mykernel<<<blocks,64>>>(v,n); // BETTER

CUDA can only schedule 32 threads (a warp) at a time, and if you try to schedule less than that, the rest of the threads in that warp still run, which is a bug or a slug, or are unavailable to run, which is a slug.

Too Few Blocks

You don’t always know the incoming size N of your vector data structure (well, actually, you often do in AI engines, because they have static dimensions, but anyway). Let’s try to generalize our computation of how many blocks with each having a fixed number of threads. There’s a few basic points:

Each block has the same number of threads (i.e., the threads-per-block)

You can’t run half a block (all its threads will run, even if you don’t need that many).



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Popular ebooks
Whisky: Malt Whiskies of Scotland (Collins Little Books) by dominic roskrow(56004)
What's Done in Darkness by Kayla Perrin(26586)
The Fifty Shades Trilogy & Grey by E L James(19073)
Shot Through the Heart: DI Grace Fisher 2 by Isabelle Grey(19053)
Shot Through the Heart by Mercy Celeste(18930)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 10 by Isuna Hasekura and Jyuu Ayakura(17104)
Python GUI Applications using PyQt5 : The hands-on guide to build apps with Python by Verdugo Leire(16973)
Peren F. Statistics for Business and Economics...Essential Formulas 3ed 2025 by Unknown(16865)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 03 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(16813)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 01 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(16437)
The Subtle Art of Not Giving a F*ck by Mark Manson(14345)
The 3rd Cycle of the Betrayed Series Collection: Extremely Controversial Historical Thrillers (Betrayed Series Boxed set) by McCray Carolyn(14125)
Stepbrother Stories 2 - 21 Taboo Story Collection (Brother Sister Stepbrother Stepsister Taboo Pseudo Incest Family Virgin Creampie Pregnant Forced Pregnancy Breeding) by Roxi Harding(13609)
Scorched Earth by Nick Kyme(12760)
Drei Generationen auf dem Jakobsweg by Stein Pia(10960)
Suna by Ziefle Pia(10885)
Scythe by Neal Shusterman(10331)
International Relations from the Global South; Worlds of Difference; First Edition by Arlene B. Tickner & Karen Smith(9517)
Successful Proposal Strategies for Small Businesses: Using Knowledge Management ot Win Govenment, Private Sector, and International Contracts 3rd Edition by Robert Frey(9360)
This is Going to Hurt by Adam Kay(9165)